Prosodic classification of discourse markers

نویسندگان

  • Vera Cabarrão
  • Helena Moniz
  • Jaime Ferreira
  • Fernando Batista
  • Isabel Trancoso
  • Ana Isabel Mata
  • Sérgio Curto
چکیده

The first contribution of this study is the description of the prosodic behavior of discourse markers present in two speech corpora of European Portuguese (EP) in different domains (university lectures, and map-task dialogues). The second contribution is a multiclass classification to verify, given their prosodic features, which words in both corpora are classified as discourse markers, which are disfluencies, and which correspond to words that are neither markers nor disfluencies (chunks). Our goal is to automatically predict discourse markers and include them in rich transcripts, along with other structural metadata events (e.g., disfluencies and punctuation marks) that are already encompassed in the language models of our in-house speech recognizer. Results show that the automatic classification of discourse markers is better for the lectures corpus (87%) than for the dialogue corpus (84%). Nonetheless, in both corpora, discourse markers are more easily confused with chunks than with disfluencies.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Prosody of Discourse Structure and Content in the Production of Persian EFL Learners

The present research addressed the prosodic realization of global and local text structure and content in the spoken discourse data produced by Persian EFL learners. Two newspaper articles were analyzed using Rhetorical Structure Theory. Based on these analyses, the global structure in terms of hierarchical level, the local structure in terms of the relative importance of text segments and the ...

متن کامل

Prosodic and Lexical Correlates of Swedish Discourse Markers in Spontaneous Dialogue

Discourse markers are words or phrases that speakers use at the beginning of a contribution to signal how it relates to prior discourse. They mark changes in the global discourse structure by e.g. signalling the beginning of a new topic or the return to a previous topic. However, words that are used as discourse markers often also have a sentential function. If discourse markers are to be used ...

متن کامل

Discourse Markers in Dialogue

Discourse markers are words or phrases that function as indicators of the structure of discourse by e.g. signalling the beginning of a new topic. This study reports on the Swedish discourse markers men ‘but’/‘and’ and så ‘so’, their functions and prosodic correlates in dialogue.

متن کامل

A method for automatic and dynamic estimation of discourse genre typology with prosodic features

This paper presents a work-in-progress on the automatic analysis of discourse genre in non-elicited speech. The study is focused on the development of bottom-up methods for automatic validation of discourse typologies found in linguistic descriptions (prosodic, syntactic, pragmatic and/or contextual and situational cues). The linguistic classification examined here opposes five discourse genres...

متن کامل

A Phonological Study on Japanese Discourse Markers

A spontaneously spoken, natural Japanese discourse contains many instances of the so-called redundant interjections and of backchannel utterances. These expressions have not hitherto received much attention and few systematic analyses have been made. We show that these utterances are characterizable as discourse markers, and that they comprise a well-defined category, characterizable in a regul...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015